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Method and System for Configuring RAID Subsystems 
With Block I/O Commands and Block I/O Path 



Field of the Invention 



This invention relates generally to the field of disk storage subsystems, and more 
particularly to configuring redundant arrays of independent disks (RAID) 
5 subsystems, 

D 

O Background of the Invention 

if] 

tap 

0 Most modern, mid-range to high-end disk storage subsystems are arranged as 
|D redundant arrays of independent disks (RAID). A number of RAID levels are 
u known, RAID-1 includes sets of N data disk drives and N mirror disk drives for 
J storing copies of the data disk drives. RAID-3 includes sets of N data disk drives 
X: and one parity disk. RAID-4 also includes sets of N+l disk drives. RAID-5 
distributes parity data across all disk drives in each set of N+l disk drives. 

15 

A RAID subsystem must be configurable. That is, the arrays, which perform I/O 
operations for users, must be created, managed, and deleted. Many other functions 
of the RAID subsystem must also be performed, such as on-line expansion, RAID 
level migration, and assigning spare space. A configuration description of the 
20 arrays must also be readable, and configuration applications must be able to 

monitor the arrays during operation. For that reason, a configuration interface and 
module are included with all RAID subsystems. 
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Figure 1 shows components of a computer system 10 including a typical prior art 
RAID subsystem 100. The RAID subsystem 100 includes a RAID configuration 
module 101, and one or more RAID user data arrays 102 each with their own array 
Identification (ID) 104. User applications 111 use a block I/O path 121 to read and 
5 write, generally "access/' user data of the RAID arrays 102 using the ID 104 
assigned to each array. Physically, the RAID arrays 102 store the user data on 
block storage devices 130. 

Ji; The problem with the prior art RAID subsystems is that a configuration application 
W 110 must use a specialized configuration path 120 to communicate with the 

"T'~ 

lj configuration module 101 to perform the configuration functions described above. 

m 

Figure 2 shows the layers of the block I/O path or calling stack 121 used for user 
u data transfer. The software and hardware of the block I/O path 121 includes, from 
Js the top host side to the bottom client side, an operating system (O/S) - application 
m interface 201, an O/S driver interface 202, an I/O driver 203, a host transport 

interface 204, an I/O transport 205, a client or external transport interface 206, a 

transport driver 207, and a RAID subsystem interface 208. 

20 The interface 201 translates I/O requests executed by the user applications 111 for 
the O/S driver interface 202. The I/O driver 203 translates specific O/S calls for the 
transport layer 205, such as PCI, iSCSI, Infiniband, SCSI, or fibre-optic channel, 
or some other high bandwidth bus. The transport driver 207 communicates with the 
RAID subsystem interface 208, which in turn communicates with the RAID 

25 subsystem 100. 
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The block I/O path 121 is relatively straightforward because most operating 
systems and buses use the same basic functions. Only minor differences are found 
among operating system interfaces. Typically, most of the layers of the stack that 
form the block I/O path 121 are well defined. The O/S - application interface 201 , 
5 the O/S driver interface 202, transport interfaces and transport 204-206 are 
typically standardized. Some of the drivers and interfaces may be specialized or 
proprietary. For instance, most RAID subsystems require a specially designed I/O 
driver 203, transport driver 207, and RAID subsystem interface 208. 

Ii However as shown in Figure 3, the layers of the stack that form the configuration 

U1 path 120 generally have far more specialized components than the block I/O path. 

Q A configuration interface 301 to the configuration application 110 is typically a 

a proprietary set of IOCTLs (I/O control commands) that need to be designed and 

y, defined. A configuration driver 302 implements these specialized IOCTLs to allow 

B control of the RAID subsystem 100. The transport layers 303-305, which also can 

n 

T use SCSI, fibre, or PCI buses, have specialized block I/O commands, and usually 
the specialized transport interface 303 is customized. For SCSI buses, for instance, 
the transport interface 303 uses mode pages with devices. In fact, many RAID 
subsystems even include a separate third physical path, such as Ethernet, or serial 

20 ports to implement the transport layers 303-305. The driver or external side 

transport interface 305 also uses specialized primitives that communicate with a 
specialized transport driver 306, and of course a specialized RAID configuration 
interface 307. 

25 In all RAID subsystems, the configuration interface 301, configuration driver 302, 
host side configuration transport interface 303, client side configuration transport 
interface 305, transport driver 306 and RAID configuration interface 307 are 
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specialized. Additionally, many RAID subsystems even have a specialized 
configuration transport 304. 

Therefore, current RAID subsystems have a severe problem. The configuration 
5 portions perform many functions, yet these are not pre-defined. Every operating 
system requires a re-implementation of the configuration functions and interfaces, 
and every configuration application needs specialized software, and perhaps, 
access to specialized hardware. This increases the cost and complexity of the 
Jr computer system 10. 

f 

W Even if the configuration stack 120 were as easy to implement as the block I/O 

5 stack 121, it is still requires a completely separate implementation to communicate 

T with the configuration module 101 . A lot of effort is required to make this possible. 

S In addition, the configuration path 120 often has many limitations with respect to 

|5 the number of commands and the amount of information that it can process. 

Therefore, there is a need for a RAID subsystem that does not require a specialized 
configuration path so that the cost and complexity of the entire computer system 
can be reduced. 

20 

Summary of the Invention 

The invention provides a RAID subsystem that uses only a block I/O path for 
accessing both RAID user data arrays and a configuration array. A configuration 
25 application uses the same block I/O path as the user application to issue commands 
to a configuration array, and to receive data and status in response to these 
commands. 
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In fact, to the configuration application, the configuration array appears as just 
another RAID array. The configuration array is distinguishable from a user array 
only by its associated configuration array identification (ID). The configuration 
5 array ID enables the configuration application to locate the configuration array on 
the RAID subsystem. 

Unlike the user data array, the configuration array does not store user data. Instead, 

M 1 the configuration array receives configuration commands, processes these 

o 

W commands, and supplies the configuration application with configuration 

ffi information in response to the configuration commands. The configuration array 

Jr 

5 ID is accessed with block I/O commands, and uses block I/O data. 

Hi : : 

u More particularly, a configurable RAID subsystem includes a user data array 

g connected to a user application via a block I/O path, and a configuration array 

H connected to a configuration application via the same block I/O path. 

The user data array processes user data access commands executed by the user 
application, and the configuration array processes configuration commands. The 
20 user data access commands and the configuration commands are communicated to 
the user data array and the configuration array respectively, via the block I/O path. 

A dynamic identification is assigned to the user data array by the configuration 
array, and a static identification is assigned to the configuration array. 

25 
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Brief Description of the Drawings 

Figure 1 is a block diagram of a prior art RAID subsystem with a block I/O path 
and a specialized configuration path; 

5 

Figure 2 is a block diagram of the layers of the block I/O path of Figure 1; 

Figure 3 is a block diagram of the layers of the specialized configuration path of 
N : Figure 1; 

m 

Uni Figure 4 is a block diagram of a RAID subsystem including a configuration array 
D according to the invention; 

is 

p Figure 5 is a flow diagram of the steps performed by the configuration application 
J|> to execute write commands to the configuration array of Figure 4; 

Figure 6 is a flow diagram of the steps performed by the configuration application 
to execute read commands to the configuration array of Figure 4; 

20 Figure 7 is a flow diagram of the steps performed by the RAID subsystem to 
process configuration write commands according to the invention; and 

Figure 8 is a flow diagram of the steps performed by the RAID subsystem to 
process configuration read commands according to the invention. 

25 
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Detailed Description of the Preferred Embodiments 
System Structure 

5 Figure 4 shows a computer system 40 including a RAID subsystem 400 according 
to the invention. The RAID subsystem 400 includes one or more user data arrays 
102. A user application 111 accesses the user data arrays 102 via a block I/O path 
121, and the arrays 102 process all block I/O commands received via the block I/O 

M path 121 from the user application 111. The RAID subsystem 400 uses block 

IB devices 130, for example, disk drives, to physically store user data. 

y] 

Q The RAID subsystem 400 according to the invention also includes a configuration 

§ 

, array 401. A configuration application 410 also uses the block I/O path 121 to 

2 access the configuration array 401. In the RAID subsystem 400 according to the 

|i invention, the configuration array 401 appears just like any of the user arrays 102, 

Jy{ except for its predetermined and static configuration identification 402. 

Each array 102 exported by the RAID subsystem 400 also has an associated user 
identification 104. The dynamic user identifications 104 of the user arrays 102 are 
20 part of the configuration information assigned and provided by the configuration 
array 401 that has the static configuration ID 402. The identifications can be stored 
in a memory of the RAID subsystem 400. 

With this arrangement, only one set of drivers needs to be implemented for the 
25 application 410 and 111, because all communications with the RAID subsystem 
400 are via the single block I/O path 121. That is, the configuration application 410 
according to the invention uses the same call stack 121 as the user application 111. 
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However, the configuration array 401 has a static identification 402, while the user 
arrays 102 have dynamic identifications 104 that are assigned when the arrays are 
created by the configuration application 410. 

Because the block I/O path usually includes a hardware network such as Ethernet, 
or buses such as iSCSI, SCSI, and fibre-optic, the configuration application 410 
only needs to include primitive I/O command processing for the operating system 
on which it executes, regardless of whether the RAID subsystem 400 is local or 
external to the computer system. 

For an external RAID subsystem, the arrangement according to the invention is 
simple to implement. It does not matter what type of network or bus, internal or 
external, is used to connect to the RAID subsystem 400. All that is needed is the 
block I/O path 121 with whatever hardware and software layers that are used to 
implement it. After the configuration application 410 has determined the 
configuration ID 402, it can direct all configuration commands to the configuration 
array 401 via the block I/O path 121, and not via a separate specialized 
configuration path as in the prior art. 

User Block I/O Command Execution and Processing 

Block I/O user data access commands typically include write and read commands 
for specific blocks within the user arrays 102. When a write command is 
processed, the only information that is sent back to the user application 111 is a 
status that indicates whether the command was processed successfully, or not. A 
read command will also provide user data. In prior art RAID arrays 102, all 
commands are processed completely independent of each other. That is, the 
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processing of a read command is done without any knowledge of a prior write 
command, for example. This makes the implementation of the configuration array 
401 using only write and read commands non-obvious. 

5 Configuration Write Command Execution 

Figure 5 shows the steps of configuration write command execution according to 
the invention. In this case, a configuration write command is assembled 510 by the 
[l configuration application 410. Then, the configuration application 410 writes to a 
M "block" 520 of the RAID subsystem 400 using the block I/O path 121 and the 
ill configuration ID 402. The word "block" is used loosely here to emphasize that the 

C configuration array 401 appears like any other array to the configuration 

m 

■ application, although, in fact, it implements configuration functions of a 

w 

u configuration module. After processing the write command, the RAID subsystem 

J5 uses a universally available block I/O status mechanism to return success or failure 

|j 530 to the configuration application 410. 

Configuration Read Command Execution 

20 Figure 6 shows the steps of configuration read command execution according to 
the invention. In this case, the configuration application 410 requests to receive 
information pertaining to the status or structure of the RAID subsystem 400. 

To execute a configuration read command, the configuration application first 
25 obtains an application ID. The application ID is used by the configuration array 
401 to track commands executed by a specific configuration application should 
multiple configuration applications be executing concurrently. 

9 
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Therefore, the configuration application reads a predetermined "block" 610 of the 
configuration array 401, e.g., block 0. When the configuration array 401 processes 
a read command for this block, it returns a unique application ID for the current 
5 configuration application 410. From that time on, the configuration array 401 can 
track commands executed by that specific configuration application with the 
associated application identification. 

£ For subsequent read commands, the configuration application 410 assembles 620 a 

J© request to read a configuration information data structure that will convey to the 

HI RAID subsystem the specific configuration information that is requested. The 

Q configuration application then writes 630 that request to the configuration array by 

as 

3 actually executing a block I/O write for the block with the application ID. If the 
p return status is error 631, the command is reissued. 

tt 

S Otherwise, if the write command completes successfully 632, the configuration 
application next issues a block I/O read 640 to the configuration array 401 to 
receive the requested information from the RAID subsystem by reading the 
"block" in the configuration array 401 with the number that is equal to the 

20 application ID. 

The configuration array then uses the application ID to identify the requested 
configuration information and returns the information with appropriate status 650 
to the requesting configuration application. 

25 
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RAID Subsystem Configuration Array Implementation 

The RAID subsystem 400 receives block I/O write and read commands for the 
configuration array 401 via the block I/O path 121. The configuration array 
processes the commands to create or delete arrays in the RAID subsystem. The 
configuration array can also perform all other necessary functions to manage the 
RAID subsystem, such as, on-line expansion, RAID level migration, and assigning 
spare space. A configuration description of the arrays is also readable, and 
configuration applications can monitor the performance of the user arrays during 
operation. 

Processing Configuration Write Commands 

Figure 7 shows the steps for processing configuration write commands according 
to the invention. After the configuration array receives a block I/O write command 
710, associated internal data structures within the RAID subsystem are locked 720. 
Typically, these data are locked to prevent multiple configuration applications 
modifying the same data at the same time, and also to prevent user access to 
portions of the user data arrays while reconfiguration is performed. 

The configuration write command is then processed 730 and the appropriate status 
is set. For example, a new array is created, or an old array is deleted. Then, the data 
structures are unlocked 740, and the status is returned 750 to the configuration 
application. 
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Processing Configuration Read Commands 

Figure 8 shows the steps for processing configuration read commands. For the first 
use of each application, the application must acquire an application ID. The 
5 configuration device, will, therefore get a read once 805 from any new application 
to the predetermined block that returns application IDs. That read will cause a new 
ID to be generated and returned to the application. 

»f Next, a configuration read command with a read configuration information data 
W structure is received 810 by a block I/O write the configuration array. This means 
Iff that the configuration application will subsequently also execute a deferred read 

ass; 

O command. The specific read command, which includes the application ID, is then 
7 stored 820 within the RAID subsystem. 

jji When the subsequent block I/O read is received 830 by the configuration array, the 
Si block number, i.e., application ID, is compared with any number of pending read 
command/application ID pairs that were stored. When a match pair is found, the 
appropriate data structures within the RAID subsystem are locked 840, and the 
requested data are filled 850 into the application read buffer. After the read buffer 
20 is filled, the RAID subsystem data structures are unlocked 860, and the read 
information is returned 870 to the configuration application along with a return 
status. 

Detailed descriptions of the preferred embodiment are provided herein. It is to be 
25 understood, however, that the present invention may be embodied in various 
forms. Therefore, specific details disclosed herein are not to be interpreted as 
limiting, but rather as a basis for the claims and as a representative basis for 
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teaching one skilled in the art to employ the present invention in virtually any 
appropriately detailed system, structure or manner. 
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